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(54) TRANSFORMED SILKWORM PRODUCING HUMAN COLLAGEN 



(57) The present invention provides a transfonmed 
silkwomn which has a polynucleotide encoding human 
collagen within the genomic DNA and produces recom- 
binant human collagen as a part of proteins in the co- 
coon or the silk gland; a process for producing recom- 
binant human collagen by using this transformed silk- 
worm; and a recombinant vector for use in the genera- 
tion of the transformed silkworm. Since human collagen 
is collected from the cocoon discharged by this trans- 
formed silkwonm or its silk gland, highly pure human col- 



lagen can be conveniently obtained In a large amount. 
Moreover, because the recombinant human collagen 
produced by the transfonmed silkworm Is safe collagen 
which is free from any fear of the contamination with 
pathogens such as viruses or prions and exhibits no an- 
tigenicity toward humans. Thus, it can be utilized in var- 
ious industrial fields including medicines, foods, cos- 
metics and the like. 
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Description 
Technical Field 

5 [0001] The invention of the present application relates to a transformed silkworm which produces recombinant human 
collagen. More particularly, the present invention relates to a transformed silkworm which produces recombinant human 
collagen; a recombinant vector for use in the generation of this transformed silkworm; and a process for producing 
recombinant human collagen. 

10 Baclcground Art 

[0002] Collagen is a major protein which constitutes extracellular matrices, and has various physiological functions 
controlling cell proliferation, differentiation, migration and the like in addition to mechanical functions to maintain the 
structure of a living organism by serving as a scaffold of cells. Therefore, collagen has been widely utilized in a medical 

IS field as a biomaterlal for repairing injury of a living organism (J. Surg. Res. 10: 485-491, 1970), or as a earner for 
sustained release of a certain type of drugs (J. Controlled Release 33: 307-31 5, 1 995). However, most collagen which 
is used at present is derived from an animal tissue such as one from a cattle or a pig. It is known that an allergic 
response occurs in patients accounting for about 3% when such collagen is transplanted to a human (J. Immunol. 136: 
877-882, 1986; Biomaterials 11: 176-180, 1990). Furthermore, dangers of the contamination with pathogens such as 

20 viruses or prions in collagen derived from animal tissues have been big problems In recent years. Thus, it has been 
desired to develop a system for producing recombinant human collagen without antigenicity and dangerous contami- 
nation of pathogen. Therefore, the inventors of this application achieved an invention and filed a patent application of 
a process for producing recombinant human collagen having a triple hellcat structure which is equivalent to that in a 
human living body through infecting a recombinant virus having an inserted cDNA encoding human collagen to insect 

25 cells (JP-A-8-23979). Moreover, a process for producing human collagen by using mammalian cells or yeast has been 
also proposed (JP-T-7-501937). 

[0003] As described above, processes in which insect cells, mammalian cells, or yeast are used have been proposed 
as the process for producing recombinant human collagen. However, according to the process in which insect cells or 
mammalian cells are used, to achieve high amount of the production is difficult which allows for utilization in a medical 
30 field. Further, according to the process in which yeast is used, the recombinant product is produced in the fungus 
bodies, therefore, purification of recombinant human collagen is not necessarily easy. 

[0004] The present invention was accomplished in light of the circumstances as described above, and an object of 
the invention is to solve the problems in the conventional art and to provide a process for producing recombinant human 
collagen with high productivity and the easiness in purification as well as the genetb engineering materials for the same. 

35 

Disclosure of Invention 

[0005] A first aspect of the present invention is a transformed silkwonn, which comprises a polynucleotide encoding 
human collagen within the genomic DNA and produces recombinant human collagen as a part of proteins in the cocoon 
40 or the silk gland. 

[0006] A second aspect of the invention is a transformed silkwonn, which comprises a polynucleotide encoding a 
fusion protein of human collagen with a silkworm silk protein within the genomic DNA and produces the fusion protein 

as a part of proteins in the cocoon or the silk gland. 

[0007] Exemplary transfomned silkwomis according to the first and second aspects of the invention may include adult 
45 silkwomns, larvae, pupae and eggs. 

[0008] Further, in a preferred embodiment of the transfonmed silkwomn according to the first and second aspects of 
the invention, a polynucleotide encoding at least one of the a subunit and the P subunit of prolyl hydroxylase is Included 
within the genomic DNA. 

[0009] A third aspect of the invention is a process for producing recombinant human collagen, which comprises 
50 isolating and purifying recombinant human collagen from the cocoon or the silk gland of the transformed silkworm of 
the first aspect of the invention. 

[0010] A fourth aspect of the invention is a process for producing recombinant human collagen, which comprises 
isolating a fusion protein of recombinant human collagen with a silkworm silk protein from the cocoon or the silk gland 
of the transfomied silkwomn of the second aspect, and isolating and purifying recombinant human collagen from the 
55 fusion protein. 

[0011] A fifth aspect of the invention is a fusion protein of recombinant human collagen with a silkwomn silk protein 
produced by the transformed silkworm of the second aspect of the invention. 

[001 2] A sixth aspect of the invention is a targeting vector derived from Autographa catifomica nuclear polyhedrosis 
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virus, which Is used for executing homologous recombination of a polynucleotide encoding human collagen to an 
arbitrary region In a sllkwomri genome, and producing a transfomned silkwonn which produces human collagen. In the 
invention, "polynucleotide encoding human collagen" refers to a nucleic acid molecule which expresses human colla- 
gen, and implies a human genomic DNA, an mRNA transcribed from a genomic DNA, a cDNA synthesized from an 
5 mRNA or the like. 

[0013] One embodiment of the targeting vector of the sixth aspect of the invention is that both ends of the polynu- 
cleotide encoding human collagen have a DNA sequence homologous to an arbitrary region of a genomic DNA en- 
coding a silkworm silk protein (silkwonn silk protein gene). This targeting vector executes homologous recombination 
of a polynucleotide encoding human collagen downstream of an expression regulatory sequence (promoter/ enhancer 

10 sequence) of a silk protein gene of a silkworm genome (i.e., endogenous). 

[0014] Another embodiment of the targeting vector of this sixth aspect of the invention is that both ends of a human 
collagen expression cassette have a DNA sequence homologous to an arbitrary region of a silkwomi genomic DNA. 
This targeting vector executes homologous recombination of the human collagen expression cassette to an arbitrary 
region of a sllkwonm genome. This "expression cassette" means a fusion polynucleotide including a polynucleotide 

IS encoding human collagen iigated under the control of an expression regulatory sequence of a silkworm silk protein 
gene. 

[0015] A seventh aspect of this invention is a targeting vector derived from Autographs catifornica nuclear polyhe- 
drosls virus for executing homologous recombination of a polynucleotide encoding at least one of the a subunit and 
the p subunit of prolyl hydroxylase to an arbitrary region of a silkworm genome. 

20 [0016] An eighth aspect of the invention is a recombinant plasmid vector having a human collagen expression cas- 
sette in a region sandwiched between a pair of inverted terminal repeats of DNA transposon derived from an Insect. 
[0017] A ninth aspect of the invention is a recombinant plasmid vector having a polynucleotide encoding at least one 
of the a subunit and the p subunit of prolyl hydroxylase to a region sandwiched between a pair of inverted tenminal 
repeats of DNA transposon derived from an insect. 

25 [0018] A tenth aspect of the invention is a set of vectors comprising a recombinant plasmid vector of the aforemen- 
tioned eighth aspect of the invention, and a recombinant plasmid vector having a polynucleotide encoding transposase 
of transposon. Through use of this set of vectors, the human collagen expression cassette is transferred to a silkworm 
genome and thereby a transfomned silkwonn is generated which produces human collagen. 

[0019] An eleventh aspect of the Invention is a set of vectors comprising a recombinant plasmid vector of the afore- 
30 mentioned ninth aspect of the invention, and a recombinant plasmid vector having a polynucleotide encoding trans- 
posase of transposon. Through use of this set of vectors of the eleventh invention and the set of vectors of the afore- 
mentioned tenth invention, a transfomned silkwonn which produces human collagen with hydroxyproline is generated. 
[0020] A twelfth aspect of the invention is a polynucleotide encoding the a subunit of silkworm prolyl hydroxylase 
having an amino acid sequence of SEQ ID NO: 2. Specifically, the polynucleotide is a DNA fragment having a base 
35 sequence set out In SEQ ID NO: 1 . 

[0021] In a prefen^ed embodiment of each invention of this application, the polynucleotide encoding the prolyl hy- 
droxylase a subunit is at least a coding region of a DNA fragment having the base sequence of SEQ ID NO: 1 . Fur- 
thermore, in a preferred embodiment, the human collagen expression cassette is a fusion polynucleotide including a 
polynucleotide encoding human collagen Iigated under the control of an expression regulatory sequence of a silkworm 
^ silk protein gene, or Is a fusion polynucleotide Including a polynucleotide encoding a silkworm silk protein and a poly- 
nucleotide encoding human collagen Iigated under the control of an expression regulatory sequence of a silkworm silk 
protein gene. 

Brief Description of Drawings 

45 

[0022] 

Fig. 1 Is a restriction enzyme map of a transfer vector pMOSRA-1 . 

50 Fig. 2 is a restriction enzyme map of mini collagen genes incorporated in a piggyBac plasmid vector (pMOSRA- 

4B, pMOSRA-5 and pMOSRA-6). 

Fig. 3 is an electrophoresis images of PGR products amplified with genomic DNAs extracted from positive F1 adult 
silkworms as a template. The number above images, e.g., 1 and 2 show numbers of positive silkworms. The lane 
55 M is for a 1 00 bp ladder marker. 

Fig. 4 is an electrophoresis images of fTT-PCR products amplified using an RNA extracted from the silk gland of 
positive F1 larval sllkwonns of the fifth-tnstar stage. The electrophoresis was conducted: in the lanes 1 and 2 for 
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RT-PCR products from positive silkworms; and In the lane C for RT-PCR products from a wild type silkwonn. The 
lane M Indicates a 1 00 bp ladder marker. 

Best Mode for Carrying Out the Invention 

5 

[0023] Collagen has been so far known as existing 1 9 different types including type I through type XIX. Any of these 
types of collagen is characteristic in that it is a trimer molecule fonned from three subunits (a chain), and has a triple 
helical structure within Its molecule. Among these collagens, some of them are synthesized as follows that procollagen 
that is a precursor is synthesized at first, and then converted to a mature collagen molecule. For example, fibrous 

10 collagen such as type I, II or III is synthesized as procollagen having an amino propeptide and a carboxyl propeptide 
with non-triple helical structure at an amino terminal end and a carboxyl temninal end of the triple helical region which 
is a main body thereof, followed by cleavage of both propeptides by specific protease to form mature collagen. In 
addition, collagen is subjected to a variety of posttranslational modifications including hydroxylation of proline, hydrox- 
ylation of lysine, oxidation of lysine and hydroxylyslne (modification to aldehyde) and the like during its blosynthetic 

IS pathway. In particular, hydroxylation of proline by prolyl hydroxylase is extremely important in achieving stability of 
collagen at a physiological temperature. 

[0024] Human collagen which is a subject matter of the invention may be any collagen including type I through type 
XIX collagens, and also includes any partial amino acid sequence of any one of these collagens. Further, those having 
partial alteration of the amino acid sequence of these collagens. and those having an added amino acid sequence 

20 which is not derived from collagen are also involved. Moreover, in addition to mature collagens, procollagens that are 
precursors and those having a cleaved part in a propeptide are also included. Additionally, in accordance with the 
invention, immature collagen molecules such as those with an incomplete posttranslational modification including hy- 
droxylation of proline and those with an Incomplete triple helical structure are also included in the subject matter 
[0025] Silkworms produce a cocoon through discharging a large amount of a silk thread when entered In the spinning 

25 stage. Major components of this srik thread are silk proteins that may include fibroin, P25 and sericin. The amount of 
synthesis of these silk proteins is enormous which reaches to about 0.5 g in average per one silkwonn. The silk proteins 
are synthesized in an organ which is referred to as a silk gland. The silk gland is constituted from a posterior silk gland, 
a middle silk gland and an anterior silk gland. Fibroin and P25 are specifically synthesized and secreted in the posterior 
silk gland, whilst sericin is specifically synthesized and secreted in the middle silk gland, respectively. Fibroin is a 

30 complex which is constituted from an H chain and an L chain, and P25 is further associated with this complex (J. Biol. 
Chem. 275: 40517-40528, 2000). Fibroin and P25 secreted from the posterior silk gland are gradually delivered by a 
peristaltic movement of the silk gland to the middle silk gland where they are covered circumterentlally by sericin 
secreted therefrom, and then discharged as a silk thread after being further delivered to the anterior silk gland. Hence, 
the silk gland of a silkworm is an organ having superior ability to synthesize proteins, and thus, when recombinant 

35 human collagen Is expressed In this organ, eminently high productivity can be expected. Moreover, it is markedly easy 
to collect and purify the synthesized recombinant human collagen from the cocoon discharged by the silkwomi, or from 
proteins in the silk gland because: the silk thread Is excreted from the body; only small kinds of the silk proteins constitute 
the silk thread; and fibroin which is present in a largest quantity among the silk proteins is insoluble in an aqueous 
solution. 

40 [0026] A transient expression system of a foreign gene in a silkworm is established in which a Bombyx mori nuclear 
polyhedrosis virus (BmNPV) is utilized as a vector (JP-B-7-97995). However, since this process is lethal to a silkwonn 
due to the infection with a virus, the foreign gene can not be transmitted to the following generations. Thus, the ex- 
pression of a useful protein is limited to one generation. Therefore, viral inoculation must be conducted at every time 
of allowing expression of the foreign gene. On the other hand, Mori et al. developed a process capable of introducing 

45 a gene without killing the silkworm, and transmitting the foreign gene to the next generation via the germ cells in 
instances of the infection to a female, by way of the infection to a larval silkworm with Autographs californica nuclear 
polyhedrosis virus (AcNPV) (JP-A-6-277051). Further, Yamao et al. succeeded in generating a transformed silkworm 
having a foreign gene inserted into a fibroin L chain gene of a silkworm genome by gene targeting through the infection 
of AcNPV having a part of a fibroin L chain gene sequence incorporated therein to a larval silkworm (Genes Dev. 13: 

50 511-516,1999). 

[0027] Further, Tamura et al. (Nat. Blotechnol. 18: 81-84, 2000) succeeded In generating a transformed silkworm 
having an inserted foreign gene by microinjection of a plasmid vector having incorporated piggyBac which is DNA 
transposon derived from a Lepidoptera insect, Trichopfusia nt, into a silkwonn egg. 

[0028] In the transformed silkwonns generated according to these processes, the inserted foreign gene is maintained 
55 within the chromosome without dropout, and persistently expresses a recombinant protein throughout the generations. 
[0029] In the invention, at first, a polynucleotide encoding human collagen is subjected to incorporation into AcNPV 
vector or a plasmid vector constructed on the basis of DNA transposon. Subsequently, a transformed silkworm is 
generated in which a polynucleotide encoding human collagen is Incorporated therein genomic sequence by transfor- 
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mation using any one of these vectors. 

[0030] The polynucleotide encoding human collagen which is capable for use may be a genomic DNA of human 
collagen, an mRNA or a cDNA synthesized from an mRNA. Preferably, cDNA rs used. Although instances In which a 
cDNA is used are primarily explained below, the polynucleotide used in the invention Is not limited to cDNAs. 

5 [0031] Human collagen cDNA may be any cDNA of type I through type XIX collagens. Base sequences of these 
cDNAs are available from infomnation described in a literature (e.g., Essays Biochem. 27: 49-67, 1992; Annu. Rev. 
Blochem. 64: 403-434, 1995). For example, each cDNA of any one of human type I through type XIX collagens can 
be obtained by a method in which a human cDNA library is screened using an oligonucleotide produced on the basis 
of any one of these cDNA sequences as a probe, or a PGR method in which oligonucleotides corresponding to both 

10 ends of the cDNA sequence are used as primers and a human DNA is used as a template, or an RT-PCR method in 
which an RNA extracted from a human cell is used as a template. 

[0032] In order to allow expression of a human collagen cDNA in a silkworm silk gland cell, expression regulatory 
sequences (e.g., promoter or enhancer) can be utilized which are derived from silk protein genes that may include a 
fibroin H chain, a fibroin L chain, P25 and sericin. Through construction of an expression cassette utilizing the promoter 
■ IS and enhancer of any one of these silk protein genes, recombinant human collagen can be expressed in a large amount, 
in a manner specific to the silk gland. For example, when a promoter and enhancer of the fibroin H chain, the fibroin 
L chain, P25 are used, expression of human collagen is enabled in a posterior silk gland, whilst when a promoter and 
enhancer of sericin are utilized, expression of human collagen Is enabled In a middle silk gland. The human collagen 
expression cassette may also penmit the synthesis of a fusion protein of human collagen with a silkworm silk protein 

20 by producing a fusion polynucleotide of human collagen cDNA with silk protein cDNAs which may include the fibroin 
H chain, the fibroin L chain, P25 and sericin, and incorporating this fusion polynucleotide into a siikworm genomic 
sequence in order to facilitate a secretion and silk thread discharged from the silk gland cells. In this instance, the 
fused protein may be a partial amino acid sequence of a silk protein, or may be a full length amino acid sequence. For 
example, when a fusion protein having a signal peptide of human collagen substituted for a signal peptide of a silk 

25 protein is synthesized, secretion of human collagen from the silk gland cells can be facilitated. In addition, when a 
fusion protein of human collagen with the full length of fibroin L chain is synthesized for example, thus synthesized 
fusion protein of human collagen with the fibroin L chain fomns a complex with the fibroin H chain via a disulfide bond. 
Accordingly, more effteient secretion can be effected. 

[0033] in instances where the introduction system of the gene is a gene targeting method with AcNPV vector, a 
30 human collagen cDNA can be incorporated into a silkworm genomic DNA by homologous recombination through con- 
structing a targeting vector by incorporating a human collagen cDNA into an AcNPV genomic DNA with a conventional 
method and infecting a larval silkwomn with this constructed vector. The site into which the human collagen cDNA is 
incorporated by homologous recombination may be for example, within a silk protein gene whk:h may include the fibroin 
H chain, the fibroin L chain, P25 and sericin, or may be an arbitrary genomic DNA region other than the silk protein 
35 genes. 

[0034] In instances where the incorporation site is within the silk protein gene, DNA sequences which are homologous 
to two sites of arbitrary regions within the silk protein gene are ligated respectively before and behind of a human 
collagen cDNA to construct an AcNPV targeting vector. By infection of this targeting vector, a human collagen cDNA 
is incorporated downstream of an endogenous expression regulatory sequence of a silkwonm leading to expression of 

40 human collagen from the cDNA through the action of endogenous silk protein promoter/ enhancer. Further, in this case, 
a human collagen cDNA can be also incorporated into a silk protein gene so that a fusion protein of a human collagen 
with a silk protein can be synthesized. For example, a fusion protein of a human collagen with a fibroin L chain can be 
synthesized through incorporating a human collagen cDNA immediately before a termination codon in the seventh 
exon of the fibroin L chain gene, such that an amino acid frame is serially formed. Into the AcNPV vector in this instance, 

45 are inserted DNA sequences of approximately 0.5 kb to 6.0 kb respectively, which are homologous to genomic DNAs 
of upstream and downstream from the seventh exon of the fibroin L chain, before and behind of the collagen cDNA. 
[0035] In addition, when a human collagen cDNA is subjected to homologous recombination to an arbitrary genomic 
DNA sequence other than the silk protein genes, a human collagen expression cassette is produced with a silk protein 
gene promoter ligated upstream of a human collagen cDNA. Then DNA sequences, which are homologous to approx- 

50 imately 0.5 kb to 6.0 kb respectively of upstream and downstream of the silkworm genomic DNA region to be subjected 
to homologous recombination of the expression cassette, are ligated before and behind of the expression cassette, 
and the ligated product is incorporated into the AcNPV vector to construct a targeting vector In either case where the 
homologous recombination site is in the silk protein gene, or is in an arbitrary genomb sequence other than silk protein 
genes, it is also possible that a marker gene is simultaneously incorporated together with a human collagen cDNA, 

55 and selection of a transformed silkwonn in the next generation (F1) and the generation after next (F2) is facilitated. 
Examples of the marker gene include for example, fluorescent protein genes such as GFP. Further, examples of the 
promoter to permit the expression of the marker gene Include for example, silkworm actin promoter and Drosophila 
HSP70 promoter. 
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[0036] Other embodiment for generating a transformed silkworm having a human collagen cDNA incorporated therein 
concerns transformation in which DNA transposon derived from an insect is utilized. DNA transposon derived from an 
Insect such as piggyBac, mar/ner (Insect Mol. Biol. 9: 145-155, 2000) or Minos (Insect MoL Biol. 9: 277-281 , 2000) Is 
known to exhibit a transfer activity In silkworm cells, and thus transfomnation of a silkwonn is enabled by a plasmid 

5 vector produced on the basis of such DNA transposon. In particular, transformation of a silkworm was successfully 
effected in fact by microinjection of a plasmid vector produced on the basis of piggyBac into a silkworm egg (Nat. 
Biotech nol. 1 8: 81 -84, 2000). When DNA transposon derived from an insect is utilized, the foreign gene Is incorporated 
Into an arbitrary sequence In a silkworm genome. Therefore, for pemriittlng the expression In a silk gland of the human 
collagen cDNA which was incorporated into a silkworm genomic sequence, to produce a human collagen expression 

10 cassette is required which was previously ligated upstream of the human collagen cDNA, with promoter and enhancer 
derived from silk protein genes which may include the fibroin H chain, the fibroin L chain, P25 and serlcln. 
[0037] Moreover, when a fusion polynucleotide encoding human collagen with a silk protein Is incorporated, an ex- 
pression cassette is produced with a promoter and enhancer derived from a silk protein gene ligated upstream of this 
fusion polynucleotide. Taking piggyBac as an example, characteristics and process for introducing a human collagen 

15 expression cassette are explained below, however, the DNA transposon derived from an insect which may be used in 
the invention is not limited to piggyBac, but other DNA transposon may be used which includes mariner an6 Minos. 
Process for introducing a human collagen expression cassette when such transposon other than piggyBac Is used Is 
essentially similar to the process as described below In which piggyBac is used. 

[0038] The piggyBac Is DNA transposon Isolated from TN-368 which is a cultured cell derived from a Lepidoptera 

20 insect, Trichoptusia ni. It is composed of transposase ORF which is located in the middle part thereof and Inverted 
terminal repeats of 13 bp which are positioned on both ends, and has the length of about 2.5 kb. A transposase protein 
is synthesized from the transposase ORF. By the action of a transposase, the region sandwiched between the inverted 
temninal repeats {piggyBac Itself) Is transferred to a targeted sequence, TTAA (Virology 1 61 : 8-1 7, 1 989). For insertion 
of a human collagen expression cassette into a silkworm genomic sequence utilizing such property of piggyBac, similar 

25 process to the process of for example, Tamura et al. (Nat. Blotechnol. 18: 81-84, 2000), may be carried out. More 
specifically, a pair of the inverted terminal repeats of piggyBac is incorporated into an appropriate plasmid vector, 
thereby inserting a human collagen expression cassette such that it is sandwiched between the pair of the inverted 
terminal repeats. Then this plasmid vector Is mtcrolnjected Into a silkworm egg together with a transposase expression 
vector (a helper plasmid) of piggyBac, This helper plasmid is a recombinant plasmid vector with deletion of one or both 

so of the Inverted terminal repeats of piggyBac, and substantially having transposase gene region of piggyBac alone 
incorporated therein. In this helper plasmid, a promoter which may be utilized for allowing expression of transposase 
may be an authentic transposon promoter as it Is, or may be a silkwonm actin promoter or Drosophiia HSP70 promoter 
or the like. In order to facilitate the screening of silkworms in the next generation, a marker gene may be concomitantly 
incorporated into the vector having the collagen expression cassette Incorporated. In this instance, a promoter se- 

35 quence such as e.g., a silkworm actin promoter, Drosophiia HSP70 promoter or the like Is Incorporated upstream of 
the marker gene, and the expression of the marker gene is activated on behalf of inserted promoter. 
[0039] Selection of a transfomned silkworm from silkworms In the F1 generation, and from silkwomris in additional 
F2 generation in case where the AcNPV vector is used, may be performed using for example, a PGR method or a 
Southern blot method. In addition, when a marker gene was incorporated, selection using its phenotypk: character is 

40 also possible. For example, when a fluorescent protein gene such as GFP was utilized as a marker gene, an excitation 
light is irradiated on eggs or larvae of silkworms in the F1 or F2 generation, and selection can be perfected by detecting 
fluorescence emitted from the fluorescent protein. The silkworm selected in such a manner is a transformed silkworm 
having the human collagen cDNA incorporated in its chromosome. Therefore, the human collagen cDNA is transmitted 
without disappearance also in offspring generated by mating of these silkworms with wild type silkwonns, or of trans- 

45 formed silkwonms with each other. Thus, production of human collagen or a fusion protein of collagen with a silk protein 
is penmitted throughout the generations. 

[0040] When a polynucleotide encoding prolyl hydroxylase is introduced.to the transformed silkwonm which produces 
human collagen as described above in a mode to enable the expression in a silk gland, completely heat stable human 
collagen can be produced at a physiological temperature. The prolyl hydroxylase polynucleotide to be introduced (e. 

50 g., cDNA) may be a polynucleotide derived from a human or any other animal, or may be a prolyl hydroxylase polynu- 
cleotide derived from a silkwonm. The prolyl hydroxylase is a complex of the a subunit and the p subunit. Unless this 
complex is fonmed, no enzymatic activity is caused. The a subunit is a subunit having an enzymatic activity, and orig- 
inally exists in a cell which produces collagen. To the contrary, the p subunit is a polypeptide which is identical to protein 
disulfide isomerase which is an enzyme that catalyzes structural conversion of a disulfide bond of a protein, and uni- 

55 versaDy exists in all cells in a relatively large amount Since a sllkwomn silk gland cell does not produce a targe amount 
of collagen, only a small amount of the a subunit is present, but a relatively large amount of ^ subunit is present therein. 
For introducing a prolyl hydroxylase polynucleotide to a silkwonm and allowing expression of prolyl hydroxylase In a 
silkwonm sitk gland, a process in whrch a prolyl hydroxylase polynucleotide derived from an animal such as a human 
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as described above Is used, or a process In which a prolyl hydroxylase polynucleotide of a silkworm Is used may be 
employed. When a prolyl hydroxylase polynucleotide derived from an animal such as a human is used, two kinds of 
polynucleotides respectively encoding the a subunit and the p subunit are required. Because the p subunit of an Insect 
scarcely fonns an active complex with the a subunit derived from an animal such as a human, an active enzyme can 
5 not be synthesized with merely expression of the a subunrt alone derived from an animal such as a human. To the 
contrary, when a prolyl hydroxylase polynucleotide of a silkworm Is used, use of the a subunit alone may be acceptable, 
because the a subunit which is expressed in a silk gland can form an active complex with endogenous ^ subunit that 
exists in a comparatively large amount. 

[0041] This application provides a novel polynucleotide encoding a silkworm prolyl hydroxylase o subunit (SEQ ID 
10 NO: 2). This polynucleotide is a cDNA having a base sequence set out in SEQ ID NO: 1 , and a DNA fragment Isolated 
and purified from a genomic DNA and an RN A fragment. A genomic DNA fragment or an RNA fragment can be obtained 
by screening or PGR in which an oligonucleotide or the like is used which is produced on the basis of the base sequence 
set out in SEQ ID NO: 1 . 

[0042] In order to allow the expression of prolyl hydroxylase in a silkworm silk gland through introducing a poiynu- 

15 cleotide encoding prolyl hydroxylase into a silkworm chromosome, a promoter capable of allowing gene expression in 
the silk gland Is utilized. Examples of the promoter which satisfy this requirement include promoters of silk proteins 
that may include fibroin, P25 and serlcin which are capable of allowing expression of a gene in only a silk gland, and 
promoters such as silkworm actin, Drosophila HSP70 and 1E1 of AcNPV which allow expression in any tissue. For 
introducing a prolyl hydroxylase polynucleotide into a silkworm chromosome, similar methods as one for introducing 

20 a human collagen cDNA as described above may be carried out, i.e. , a gene targeting method in which AcNPV is used, 
or a method In which a plasmid vector produced on the basis of DNA transposon such as piggyBac Is microinjected 
into a silkworm egg. For generating a silkworm having both polynucleotides of human collagen and prolyl hydroxylase, 
a prolyl hydroxylase polynucleotide may be introduced into a transformed silkworm having a human collagen polynu- 
cleotide, or on the contrary, a human collagen polynucleotide may be introduced into a transformed silkwomn having 

25 a prolyl hydroxylase polynucleotide. Alternatively, a transformed silkwomri having a human collagen polynucleotide 
alone, and a transfomned silkworm having a prolyl hydroxylase polynucleotide are separately generated followed by 
mating therebetween, and then any transformed silkworm having both polynucleotides may be selected. Both polynu- 
cleotides can be introduced respectively with an AcNPV vector, or both polynucleotides can be introduced respectively 
with a DNA transposon vector, however, a human collagen polynucleotide may be introduced using an AcNPV targeting 

30 vector whilst a prolyl hydroxylase polynucleotide may be introduced using a DNA transposon plasmid vector. Altema- 
tively, the reverse fashion thereof may be pennissible. 

[0043] The transformed sllkwomn having a human collagen cDNA synthesizes human collagen together with endog- 
enous silk proteins when it reaches to the.f ifth-instar stage, and secretes human collagen In its cocoon as a part of the 
silk thread. Human collagen in the cocoon can be readily extracted with for example, 0.5 M acetk: acid or the like. In 

35 addition, when human collagen was synthesized as a fusion protein with a silkwonm fibroin L chain, extraction can be 
executed by putting It into a reducing state to cleave a disulfide bond between the fusion protein and a fibroin H chain. 
Further, when human collagen included in the cocoon is fibrous collagen and purification of only the triple helical region 
(atelocoltagen) Is Intended, proteins in the cocoon are treated with a proteolytic enzyme such as pepsin. According to 
this manipulation, atelocollagen which is not digested with a proteolytic enzyme can be extracted, and many other 

40 proteins are subjected to digestion with pepsin. Therefore, the following purification can be readily performed. Moreover, 
human collagen can be also extracted and purified from the silk gland.of the transformed silkwonm in a similar manner 
to the instance of the cocoon. Because the silk gland can be easily separated by dissection of the silkworm, and almost 
of the proteins included therein are silk proteins, human collagen can be readily extracted and purified similariy to the 
case of the cocoon. 

45 

Examples 

[0044] The present Invention Is more specifically explained with reference to Examples relating to processes for 
producing human type III collagen, however, the present invention should not be construed as being limited to these 
50 Examples. 

Example 1 

Productton of Transformed Silkworm by Gene Targeting Method Using AcNPV Vector 

55 

[0045] As cDNA encoding human type III procollagen, cDNA clone which previously obtained by the inventors of this 
application (JP-A-8>23979; GeneBank database Accession No. X14420) was used. A silkwomn fibroin L chain gene 
and a silkwomri genomic DNA sequence downstream of the same were isolated by the following method using base 
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sequence of known silkwonn fibroin L chain gene (Gene 100: 151-158, 1992; GeneBank database Accession No. 
M 76430). 

[0046] In the following description, base numbers In a human type HI procollagen cDNA and a silkwonn fibroin L 
chain gene are In accordance with the base numbers described In the aforementioned GeneBank database. 

5 

(1) Isolation of Silkworm Fibroin L Chain Gene and Genomic DNA Sequence in Downstream Region of the Gene 

[0047] Amplification of a gene fragment In the Intron 6 of the fibroin L chain was perfonned using a PCR method. 
Oligonucleotides synthesized on the basis of the sequence of the aforementioned database (SEQ ID NO: 3 and SEQ 
10 ID NO: 4) were used as PCR primers, and a genomic DNA of a silkworm tokai x asahi strain was used as a template. 
Next, screening for a silkworm kinshu x showa strain genomic library which was constructed by XEMBL3 was performed 
using thus resulting DNA fragment as a probe according to a conventional method. As a consequence, a silkworm 
genomic DNA fragment (pRI/10k) was obtained which contains a region spanning about 15 kb downstream from the 
base number 9600 of the fibroin L chain gene. 

15 

(2) Production of Restriction Enzyme Recognition Sequence by Site>Directed Mutagenesis 

[0048] in order to llgate the type III procollagen cDNA into the exon 7 of a fibroin L chain gene, a new restriction 
enzyme site was provided to a fibroin gene by site-directed mutagenesis. For the mutagenesis, Mutagenesis Kit of 

^ Clontech was used. A primer for the mutagenesis (SEQ ID NO: 5) was designed such that a restriction enzyme Xhol 
site (positions 8-12 in SEQ ID NO: 5) Is provided immediately before the stop codon of the fibroin L chain gene exon 
7. As a template plasmid, a plasmid obtained by subcloning of an EcoRI-SphI fragment from pRl/10k (base number: 
12000-14800) into pUC18 (pRISphl/2.9k) was used (Mut pRISphl-Xhol). In a similar manner, an Xbal site was newly 
provided by mutagenesis for Inserting a fibroin L chain gene downstream fragment (base number: 14200-18900). A 

25 primer for the mutagenesis (SEQ ID NO: 6) was designed such that an Xbal site (positions 12-1 7 In SEQ ID NO: 6) Is 
provided Immediately before the stop codon of the fibroin L chain gene exon 7. As a template plasmid, pRISphl/2.9k 
which Is similar to one described above was used (Mut pRISphl-Xhol). 

[0049] A restriction enzyme Xhol site was introduced also Into a base sequence encoding an aminopropeptide of a 
type III procollagen cDNA using a similar site-directed mutagenesis method as described above. Base sequence of 
30 the primer for the mutagenesis (SEQ ID NO: 7) corresponded to the base number of from 292 to 324 of the type III 
procollagen cDNA, which has the restriction enzyme Xhol site Inserted therein (positions 17-22 in SEQ ID NO: 7). 

(3) Construction of Transfer Vector 

35 [0050] A baculovirus transfer vector pBacPAK9 of Clontech was digested with restriction enzymes Smal and EcoRI, 
and after ligation with a DNA fragment (EcoRV-EcoRI) containing Drosophita HSP70 promoter excised from pCaSepR- 
hsp (GeneBank Accession No. U59056) and an EGFP cDNA (Clontech) Hgated thereto, the product was transformed 
to Escherichia coll DH5 a (pBacPAKhsEGFP). Ligation was conducted using Ligation Kit Ver. 1 (TaKaRa), and trans- 
formation and the like were carried out according to the conventional method. 

40 [0051 ] A fragment of the insert sequence (EcoRV-Xhol) was excised from previously completed mutant plasmid Mut 
pRISphl-Xhol, and was Hgated to the Xhol site of type III procollagen cDN A which had been constructed by mutagenesis. 
Subsequently, an SV40 polyadenylatlon signal sequence whk:h was amplified from pCEP4 (Invitrogen) by PCR was 
inserted downstream of the collagen cDNA (Bgll site). Then, thus completed fibroin-collagen-polyadenytation signal 
sequence fragment (EcoRV-Bgll) was Hgated to EcoRV-Bgll sites of pBacPAKhsEGFP. Next, from a part of the fibroin 

45 L chain gene Intron 2 to a part of the exon 7 (base number: about 10000-13100) was inserted in the EcoRV site 
(pBacPAKhsEGFP-FibI). 

[0052] Subsequently, a fragment EcoRV-SphI of 1 .7 kbp of the Insert sequence (base number: about 1 31 00-1 4800) 
was excised from the mutant plasmid Mut pRISphl-Xbal, and Inserted to Smal-SphI sites of pRI/IOk (by partial diges- 
tion). An Xbal digestion fragment excised from thus completed MutRI/1 Ok (a part of the fibroin gene exon 7 and down- 
50 Stream region thereof/ base number: about 14200-1 8900) was inserted to pBacPAKhsEGFP-FibI (pMOSRA-1 : Fig. 1). 

(4) Production of Recombinant Virus 

[0053] A recombinant virus was produced by cotransfection of thus produced targeting vector pMOSRA-1 and bac- 
55 ulovirus DNA into Insect cultured cell Sf9. After adding 4 \i\ of llpofectin (Gibco) and 4 |il of water to 7 \i\ of 0.4 ^g/jxl 
pMOSRA-1 and 1 ^1 of 0.1 \ig/[i\ linear baculovims DNA (Phanningen) and mixing welt, the mixture was dropped onto 
1 X 10^ cells of Sf9 cell which had been replaced with a serum free medium SF900-II (Gibco), and cultured. After 24 
hrs passed, 1 ml of a Grace's medium supplemented with 10% serum was added thereto followed by culture for addi- 
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tional 3 days. Then the culture supernatant was recovered. 

(5) Screening and Refinement of Reconribinant Virus 

5 [0054] To 1 X 1 0^ cells of Sf9 cell was added 1 GO |il of the aforementioned viral liquid to allow infection at 2S°C for 
1 hour. The supernatant was removed, and thereto was added 1 ml of a Grace's medium containing 1% low melting 
agarose solution (SEAPLAQUE, FMC) to penmit hardening. After culture for 3 days, an excitation light having the 
wavelength of 360 nm was irradiated on the resulting plaques. Ten plaques generating green fluorescence, i.e., plaques 
of cells which express EGFP were excised on the block with the agar Next, recombinant viruses recovered from these 

10 plaques were again infected to 1 x 1 0^ cells of Sf9 cell, and were subjected to extraction of intracellular DNA, followed 
by dot blotting of the extracted DNA. As the probe, two kinds of probes, i.e., a probe which recognizes the fibroin L 
chain gene and a probe which recognizes a procollagen cDNA were employed. Consequently, viral clones derived 
from 4 plaques were revealed to be positive. Then these positive viruses were infected to 1 x 1 0^ cells of Sf9 cell, and 
the culture supernatant was recovered after three days passed. The viruses in the culture supernatant were again 

15 infected similarly to the Sf9 cells to obtain a viral stock having a high titer. 

(6) Infection of Recombinant Baculovirus to Larval Sllkwonm of the Fifth-instar Stage 

[0055] A female larval silkworm on the first day of the fifth-instar stage was subcutaneousty injected 50 {xl of the viral 
20 liquid (5 x 10^ pfu). On the 3rd and 4th days after pupation of the inoculated larva, 10 p.1 of 20-hydroxyecdysone 
dissolved in methanol (2 mg/mt) was administered. After the ecloslon, mating was conducted with a male normal 
silkwomn moth and then oviposition was pemnitted. 

(7) Screening of F1 Silkworm 

25 

[0056] F1 eggs (1 00 to 300 eggs) laid by one female silkworm were classified into 1 group, and about 50 eggs from 
each group were sampled while the remaining eggs of respective groupsleft to be kept. DNA was extracted from the 
sampled eggs by a conventional method, and subjected to detemriination of whether or not any foreign gene was 
present by a PCR method. PGR was perfonmed such that the SV40 polyadenylation signal was detected using a primer 

30 set (SEQ ID NO: 8 and 9), or such that the HSP70 promoter was detected using a primer set (SEQ ID NO: 10 and 11). 
[0057] As a result of screening of eggs from 1800 groups in total, 15 groups were positive. Remaining eggs in the 
positive groups were hatched, and thus hatched F1 larvae were kept until the fifth-instar stage. On the third day of the 
fifth-instar stage, about 100 ^1 of a body fluid was collected from each larva. From these body fluids were separated 
body fluid cells by centrif ugatlon. After extracting the DNA from these cells, screening by a PCR method was conducted. 

35 As a consequence of screening for total 3400 larvae of the fifth-instar stage, 8 larvae were positive. Keeping of these 
silkwomris was continued, and the eclosed silkworm moth was mated with a nomrial silkworm moth to pemnit oviposition. 

(8) Screening of F2 Silkworm 

40 [0058] Screening of F2 silkworms was carried out with green fluorescence. An excitation light having the wavelength 
of 360 nm was irradiated on the first-instar larvae, and selection was executed for individuals generating green fluo- 
rescence of EGFP, i.e., individuals who express EGFP which was inserted as a marker gene. As a result of screening 
of 8 groups of F2 eggs, two positive silkworms could be obtained. 

(9) Detection of Recombinant Human Collagen 

[0059] Positive F2 silkwonms were kept until the spinning stage, and the proteins in the cocoon were extracted by 
adding thereto an SDS-sample buffer (0. 1 25 M Tris-HCI buffer, pH 6.8/4% SDS/ 1 0% 2-mercaptoethanol/ 20% glycerol) 
and mixing followed by a heat treatment at 100°C for 5 min. This sample was applied to SDS polyacrytamide gel 

50 electrophoresis (Nature 227: 680-685, 1970), and the electrophoresed proteins were transferred to a nitrocellulose 
membrane BA85 (S&S Corporation) according to the method of Matudaira et at. (J. Biol. Chem. 261: 10035-10038, 
1987). Next, the nitrocellulose membrane with the transferred proteins was subjected to a treatment in a blocking 
solution (3% BSA/ 50 mM Tris-HC) buffer, pH 7.5/150 mM NaCI) at 4°C tor 16 hrs. Thereafter, a reaction was allowed 
with a 200 x diluted anti-human/ bovine type III collagen antibody in the blocking solution at room temperature for 1 

S5 hour. Proteins which react with these antibodies were detected with VECTASTAIN ABC kit (Vector Laboratories Com- 
pany). Consequently, human type HI collagen was detected from the cocoon. 
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Example 2 

Production of Transformed Silkworm by Microinjection of piggy Bac Vector Into Egg 

5 (1 ) Production of piggy Bac PI asm id Vector 

[0060] After digesting a human type III procollagen cDNA (base number: 92-4550; GeneBank database Accession 
No. X14420) with Xhol to remove the region conresponding to the base number of 1075-3545, a mini type til collagen 
cDNA was produced through ligating the truncated ends by self ligation. This mini collagen cDNA was constituted from 
10 base sequences coding for an aminopropeptide, a part of a triple helical region (about one fifth of the entire collagen 
triple helical region) and a carboxyl propeptide, and the following three kinds of vectors were constructed on the basis 
thereof. 

i) PM0SRA-4B 

15 

[0061] The insert DNA fragments included in this vector are constituted from a silkworm sericin prompter, a mini 
collagen cDNA, and a silkworm fibroin L chain polyadenylation signal. The silkwonn sericin promoter (base number: 
1 299-1 622; GeneBank database Accession No. AB007831 ) was isolated by PCR using a primer having an added Bglll 
recognition sequence at its 5' end (SEQ ID NO: 12) and a primer (SEQ ID NO: 13) with a silkwomn genomic DNA as 

20 a template. The isolated promoter was ligated upstream of the mini collagen cDNA. Furthemnore, a silkworm fibroin L 
chain polyadenylation signal (base number: 14141-14624; GeneBank database Accession No. M76430) was isolated 
by PCR using a primer (SEQ ID NO: 14) and a primer having an added Bglll recognition sequence at its 5' end (SEQ 
ID NO: 1 5) with a silkworm genomic DNA as a template. The isolated signal was ligated downstream of the mini collagen 
cDNA (Fig. 2). After digesting both ends.of thus resulting insert DNA fragment with Bglll, protruding ends thereof was 

25 blunted by treatment with T4DNA polymerase. Thereafter, this blunt-ended fragment was inserted into a pPIGA3GFP 
vector, which had been digested with Xhol and then blunted (Nat. Biotechnol. 18: 81-84, 2000). An EGFP cDNA as a 
marker and a silkwonn actin (A3) promoter for allowing the expression of this cDNA are incorporated in the pPIGA3GFP 
vector. 

30 ii) pMOSRA-5 

[0062] The insert DNA fragments included in this vector are constituted from a silkworm fibroin L chain promoter, a 
silkwomn fibroin L chain signal peptide cDNA, a mini collagen cDNA, and a silkworm fibroin L chain polyadenylation 
signal. The silkworm fibroin L chain signal peptide cDNA (base number: 28-160; GeneBank database Accession No. 

35 XI 7291 ) was isolated by PCR using a primer (SEQ ID NO: 1 6) and a primer having an Incorporated Xhol recognition 
sequence at its 5' end (SEQ ID NO: 17) with a cDNA derived from a silkwomn silk gland as a template. In addition, in 
a similar manner to that described in Example 1 (2), an Xhol recognition sequence was introduced into encoding region 
of procollagen aminopropeptide of a mini collagen cDNA, and the region from the 5' end to Xhol cleavage site of this 
modified cDNA was substituted for isolated fibroin L chain signal peptide cDNA. Furthemnore, the silkworm fibroin L 

40 chain promoter (base number: 428-1061 ; GeneBank database Accession No. M76430) was isolated by PCR using a 
primer having an added Xbal and Bglll recognition sequences at its 5' end (SEQ ID NO: 18) and a primer (SEQ ID 
NO: 19) with a silkworm genomb DNA as a template. Thereafter the isolated promoter was ligated upstream of the 
silkwomn fibroin L chain signal peptide cDNA, and the silkworm fibroin L chain polyadenylation signal was ligated 
downstream of the mini collagen cDNA (Ftg. 2). Both ends of thus resulting insert DNA fragment were digested with 

45 Bglll. and the protruding ends were blunted by T4 DNA polymerase. Thereafter, it was Inserted into the pPIGA3GFP 
vector, which had been digested with Xhol and then blunt ended. 

iii) pMOSRA-6 

50 [0063] The insert DNA fragments included in this vector are constituted from a silkworm fibroin L chain promoter, a 
silkwonm fibroin L chain full length cDNA, a mini collagen cDNA, and a silkworm fibroin L chain polyadenylation signal. 
The cDNA covering full length of silkworm fibroin L chain (base number: 30-820; GeneBank database Accession No. 
XI 7291 ) was isolated by PCR using a primer (SEQ ID NO: 20) and a primer having an Incorporated Xhol recognition 
sequence at its 5' end (SEQ ID NO: 21 ) with a cDNA derived from a silkworm silk gland as a template. Next, in a similar 

55 method to that of producing pMOSRA-5, the aminopropeptide-encoding region from 5' end to the modified Xhol cleav- 
age site in mini collagen cDNA was substituted for the isolated cDNA. In addition, the silkworm fibroin L chain promoter 
was ligated upstream of the silkworm fibroin L chain cDNA, and the silkworm fibroin L chain polyadenylation signal 
was ligated downstream of the mini collagen cDNA (Fig. 2). Both ends of thus resulting Insert DNA fragment were 
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digested with Xbal, and the protruding ends were additionally blunted with T4 DNA polymerase. Thereafter, it was 
Inserted into the pPIGA3GFP vector, which had been digested by Xhol and then blunted. 

(2) Microinjection of Plasnnid Vector into Silkworm Egg 

5 

[0064] After purifying the aforementioned three kinds of mini collagen vectors (pMOSRA-4B, pMOSRA-5, pMOSRA> 
6) by a cesium chloride ultracentrifuge method, these three kinds of vectors and pHA3PIG which Is a helper plasmid 
(Nat. Biotechnol. 18: 81-84, 2000) were admixed. After being subjected to ethanol precipitation, the mixture was dis- 
solved in an injection buffer (0.5 mlVI phosphate buffer, pH 7.0, 5 mM KCi) such that each concentration of the mini 
to collagen vector became 66.7 jig/mi (200 p.g/ml in total) and the concentration of pHA3PIG became 200 fig/ml. This 
DNA solution was microinjected into silkworm eggs in a preblastoderm stage of 2 to 8 hrs post oviposltlon (silkworm 
embryos), in a fluid volume of about 15 to 20 nl per one egg. Microinjection was carried out to 1078 eggs In total. 

(3) Screening of F1 Larvae 

15 

[0065] When eggs having the microinjected vector DNA were incubated at 25*'C, 518 eggs were hatched. The 
hatched eggs were subsequently kept, and 413 reproductive adults were obtained, which were mated to give 213 
groups of F1 egg batches. Next, these F1 eggs were hatched, and the hatched larvae were observed by a fluorescence 
microscope for each group. As a consequence, larvae generating green fluorescence were obtained from 26 groups. 

20 Number of positive silkworms included in the positive groups was 1 to 30 per one group, and summation thereof was 
240. These positive F1 silkworms were kept and mated with a wild type silkworm to obtain silkwonns of the additional 
next generation (F2). Among the F2 silkworms, it was ascertained that about half silkwomns generated green fluores- 
cence, and that the Incorporated EFGP and mini collagen genes were transmitted to the next generation without drop 
out, according to Mendellan rule. Moreover, when a genomic DNA was extracted from F1 sllkwonn adults after the 

25 oviposltlon followed by amplification of a mini collagen DNA fragment by PGR using a primer set (SEQ ID NOs: 22 and 
23) and of an EGFP fragment by PGR using a primer set (SEQ ID NOs: 24 and 25), the mini collagen and EGFP DNA 
fragments could be detected from the positive silkwonns which generate, green fluorescence (Fig. 3). Additionally, 
Southern blot analysis was performed to conf irnri that these exogenous genes were incorporated Into the silkworm 
genomic DNA. 

30 

(4) Detection of Mini Gollagen mRNA and Recombinant Mini Gollagen 

[0066] A positive F1 larva reached to the fifth-instar stage was submitted to dissection to remove a silk gland, and 
total RNA was extracted by an acid guanidine phenol chlorofomri method. PGR was conducted using a cDNA as a 

35 template, which was obtained by reverse-transcription of the RNA in an amount of 200 ng from the extracted total RNA, 
with 30 cycles under the reaction condition of: 94'C for 1 minute, 60°C for 1 minute and 72*'C for 1 minute (RT-PGR). 
As a primer, the aforementioned primer set (SEQ ID NOs: 22 and 23) was used. Gonsequently, the mini collagen mRNA 
could be detected, and it was ascertained that the mini collagen gene Incorporated, into the silkworm genomic DNA 
was expressed in silk gland cells (Fig. 4). 

40 [0067] Next, detection of the recombinant mini collagen protein was attempted. Positive F1 silkworms were kept until 
the spinning stage, and the proteins in the discharged cocoon were extracted by adding thereto an SDS-sample buffer 
(0.125 M Tris-HCI buffer, pH 6.8/ 4% SDS/ 10% 2-mercaptoethanol/ 20% glycerol) and mixing followed by a heat 
treatment at 100°G for 5 min. This extracted sample was applied to SDS polyacrylamide gel electrophoresis (Nature 
227: 680-685, 1970), and the electrophoresed proteins were transferred to a nitrocellulose membrane BA85 (S&S 

45 Gorporation) according to the method of Matudaira et al. (J. Biol. Chem. 261 : 1 0035-1 0038, 1 987). Next, the nitrocel- 
lulose membrane with the transferred proteins was subjected to a treatment in a blocking solution (3% BSA/ 50 mM 
Tris-HGI buffer, pH 7.5/ 150 mM NaGI) at A'^C for 16 hrs. Thereafter, a reaction was allowed with a 200 x diluted anti- 
human/ bovine type III collagen antibody in the blocking solution at room temperature for 1 hour. Proteins which react 
with these antibodies were detected with VEGTASTAIN ABG kit (Vector Laboratories Gompany). Gonsequently, the 

50 recombinant mini collagen protein could be detected from the cocoon. 

Example 3 

Cloning of Silkworm Prolyl hydroxylase a Subunit cDNA 

55 

[0068] First, a partial sequence of silkworm prolyl hydroxylase a subunit cDNA was cloned by a degenerate PGR 
method using mixed primers. By comparison to amino acid sequences deduced from each reported prolyl hydroxylase 
a subunit gene of human, fruit fly and nematode, mixed primers P3 (SEQ ID NO: 26) and P5II (SEQ ID NO: 27) were 
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designed having base sequences which can be degenerated fronn the amino acid sequence conserved among them. 
In the mixed primer sequences, n indicates a, c, g or t; r Indicates a or g; y Indicates c or t; s Indicates c or g; and w 
Indicates a or t. Subsequently, for the purpose of preparing a template for PCR, total RNAs were extracted respectively 
from BmN4 cells which are silkwonn culture cells and larval silkworm of the second-lnstar stage. PGR was conducted 

5 using a cDNA as a template, which was obtained by reverse-transcription of the RNA in an amount of 200 ng from the 
extracted total RNA, with 40 cycles under the reaction condition of: 94°C for 1 minute, 58°C for 1 minute and 72°C for 
1 minute. As a consequence, an amplification product of about 1 50 bp was confirmed on electrophoresis for both of 
the BmN4 cells and larval silkworm of the second-lnstar stage. The amplification product was subcloned Into pCR2.1 
(Invltrogen), and the base sequence was detemrilned by a dideoxy method. Thus, it was revealed that this cDNA 

10 fragment was a partial fragment of the silkworm prolyl hydroxylase cDNA because it had high homology with prolyl 
hydroxylase a subunits of other animals in both respect of the base sequence level and the amino acid sequence level 
predicted therefrom. 

[0069] Then, for the purpose of cloning the full length cDNA, upstream and downstream of thus resulting silkworm 
prolyl hydroxylase a subunit cDNA fragment were Isolated by a RACE (Rapid Amplification cONA Ends) method. Primer 

15 GSP1 for 5' RACE (SEQ ID NO: 28) and primer GSP2 for 3' RACE (SEQ ID NO: 29) were designed on the basis of 
the base sequence of the cDNA fragment. RACE was carried out using SMART^*^ RACE cDNA Amplification Kit of 
Clontech. As a consequence of the RACE, cDNA fragments of about 1 .7 kb in the 5' region and of about 1.2 kb in the 
3' region could be ascertained on electrophoresis. These cDNA fragments were subcloned similarly as described 
above, and the base sequences were analyzed. Thus, the cDNA fragments contained the entire sequence encoding 

20 the silkworm prolyl hydroxylase a subunit. The base sequence of the resulting full length cDNA is set out in SEQ ID 
NO: 1 , and the amino acid sequence of the silkworm prolyl hydroxylase a subunit encoded by this cDNA ts set out in 
SEQ ID NO: 2. 

Industrial Applicability 

25 

[0070] As explained in detail hereinabove, according to the present invention, a transfonned silkworm which produces 
recombinant human collagen as a part of proteins Included In the cocoon or the silk gland; and recombinant human 
collagen produced by this silkworm are provided. Since recombinant human collagen is collected from the cocoon 
discharged by the transformed silkwomi or from the silk gland of the transfomned silkworm, extraction thereof is easy, 
30 and hence, collagen having high purity can be readily obtained. In addition, because the recombinant human collagen 
produced by the transfomned silkworm is safe collagen which Is free from any fear of the contamination with pathogens 
such as vimses or prions and exhibits no antlgenk:ity toward humans. Thus, it can be utilized in various industrial fields 
including medicines, foods, cosmetics and the like. 

35 
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SEQUENCE LISTING 

<110> Japan Science and Technology Corporation; 

Hiroshima industrial Technology Organization; 
TERUMO CORPORATE ON : 
TOKEN Ca . ltd. : and 

National institute of Agrobiological Sciences 

<120> A Transformed Silkworm producing Hunan Collagen 

<130> 01-F-027PCT 

<140> PCT/JP01/04906 
<141> 2001-06-11 

<150> JP2000-361563 
<151> 2000-11-28 

<160> 29 

<170> Patentin Ver. 2. 1 

<210> 1 
<2ri> 2483 
<212> ONA 
<213> Bombyx nori 

<220> 
<221> CDS 

<222> (228).. (1880) 
<400> 1 

tacgcggggg aaccgaccea cgtcgggctg cgtggacgca cstatctttg ccagattgtc 60 

gctttattta acgtttataa ttgtgaattg attgcaattc aatttgtggt gaatgaggtt 120 

aaactgcaat acgtttttct aataatcctt aatgatgtac gtttgtctgt aaaacaattt 180 

gctgatctgc aagttctaga atcgttttca tatttatttc tgcaaaa atg gaa aca: 236 

Met Glu Thr 
1 

gta aga gcg ata get ttt ttg ctt ctg ttt ttc ttt act tgg gcc aaa 284 
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Vai Arg Ala lie Ala Phe Leu Leu Leu Phe Phe Phe Thr Trp Ala Lys 
5 10 15 



10 



IS 



20 



25 



30 



35 



40 



45 



50 



55 



gca gag eta ttc aca gcc ata acg gat gtt gaa ccg eta ctg gaa acc 332 
Ala Glu Leu Phe Thr Ala lie Thr Asp Val Glu Pro Leu Leu Glu Thr 
20 25 30 35 

cac aag agg ate ata gac gat tta gat gat tae ctg caa aaa gaa gag 380 
His Lys Arg He lie Asp. Asp Leu Asp Asp Tyr Leu Gin Lys Glu Glu 

40 45 50 

agg aga ctt ttc act ttg aag aaa cac ttg aat tta tat aaa agg gaa 428 
Arg Arg Leu Phe Thr Leu Lys Lys His Leu Asn Leu Tyr Lys Arg Glu 

55 60 65 

cac gaa agg get atg gat gac ate cct aac tae etc gge aac ccg ate 476 
His Giu Arg Ala Met Asp Asp lie Pro Asn Tyr Leu Giy Asn Pro Me 
70 75 80 

aat get ttc acg ttg ata aaa aga tta acg gee gac ctt gat ttt ata 524 
Asn Ala Phe Thr Leu Me Lys Arg Leu Thr Ata Asp Leu Asp Phe Me 
85 90 . 95 

gag gat age att aaa att gga aca. gaa tac ata aag aac gtc aca atg 572 
Glu Asp Ser Me Lys Me Gly Thr Glu Tyr Me Lys Asn Val Thr Met 
TOO 105 MO - 115 

aac cac gtg gac gtg aaa tat ccg tea ttg gaa gat ctg acg gga gca 620 
Asn His Val Asp Val Lys. Tyr Pro Ser Leu Glu Asp Leu Thr Gly Ala 

120 125 130 

get eag gcg ctg act egg ttg eag gaa ace tae tat tta aat gta eae 668 
Ala Gin Ala Leu Thr Arg Leu Gin Glu Thr Tyr Tyr Leu Asn Val His 

135 140 145 

gat eta gee gag ggt ata ttg aat gga gtt tea tae agt act cct atg 716 
Asp Leu Ala Glu Gly Me Leu Asn Gly Val Ser Tyr Ser Thr Pro Met 
150 155 160 

acg gcg age gat tgt tae gaa etc ggc egg act eta tac aac gat aaa 764 
Thr Ala Ser Asp Cys Tyr Glu Leu. Gly Arg Thr Leu Tyr Asn Asp Lys 
155 170 175 
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10 



gat tac aca aac gcc ttg gcc tgg atg aaa gaa gcg eta agg aaa tat 

Asp Tyr Thr Asn Aia Leu Ala Trp Met Lys Glu Ala Leu Arg Lys Tyr 

180 185 190 195 

aaa gat gaa aac gtc atg tac ccg ttc acc gaa gtc gat ate ttg gaa 

Lys Asp Giu Asn Val Met Tyr Pro Phe Thr Glu Vai Asp lie Leu Glu 

200 205 210 



812 



860 



15 



tat ata gga ttc gca tat tat tta aac gga gac gtc aaa acc get ctg 
Tyr lie Gly Phe Ala Tyr Tyr Leu Asn Gly Asp Val Ly$ Thr Ala Leu 

215 220 225 



908 



20 



gaa tgg act cag aga cit ctg tec gtc gat ccg aag cae gta cga gcg 
Giu Trp Thr Gin Arg Leu Leu Ser Val Asp Pro Lys His Val Arg Ala 
230 235 240 



956 



25 



egg ggc aac ata ccg cae tat cag aag acg ata gcc gaa caa gag gcc 
Arg Gly Asn lie Pro His Tyr Gin Lys Thr Me Ala Glu Gin Glu Aia 
245 250 255 



1004 



30 



gaa tta aag aag caa caa cgt ggg gaa acc teg gat gag cct gaa gaa 
Glu Leu Lys Lys Gin Gin Arg Gly Giu Thr Ser Asp Glu Pro Glu Glu 
260 265 270 275 



1052 



35 



gaa gat ggt caa gat tac gaa tta tea gag tac gca aag gaa cgc aaa 
Glu Asp Gly Gin Asp Tyr Glu Leu Ser Glu Tyr Ala Lys Glu Arg Lys 

280 285 290 



1100 



40 



gtt tac gaa teg ctg tgt egg gga gaa atg gaa ata cec cat gag att 

Val Tyr Glu Ser Leu Cys Arg Gly Glu Met Glu I le Pro His Glu I te 

295 300 305 



1148 



45 



act aag agg ttg aaa tgt tgg tac gtc ace gac acg eat ccg ttt tta 
Thr Lys Arg Leu Lys Cys Trp Tyr Val Thr Asp Thr His Pro Phe Leu 
310 315 320 



1196 



50 



aag ttg gee cea. ate aaa gtg gag cag atg tac gtg aag cec gac ata 
Lys Leu Ala Pro lie Lys Val Glu Gin Met Tyr Val Lys Pro Asp Me 
325 330 335 



1244 



55 



ttt atg ttc cac gaa gtg atg acc gac gac gag att gag ttc ate aaa 
Phe Met Phe His Glu Val Met Thr Asp Asp Glu Me Glu Phe Me Lys 
340 345 350 355 



1292 



15 



f 
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aaa cga gca aag ccg agg ttc aaa egg get gtc gtt cac gac cct aaa 
Ly$ Arg Ala Lys Pro Arg Phe Ly$ Arg Ala Vai Val His A$p Pro Lys 

360 365 370 



1340 



10 



act ggt gag ctg aca ccg gcc cat tac cgc ate age aag teg teg tgg 
Thr Gly Glu Leu Thr Pro Ala His Tyr Arg lie Ser Lys Ser Ser Trp 

375 380 385 



1388 



IS 



etc Cgc gac gag gag tct ccg gtc ata geg cgc ate acg cag cgc gtc 
Leu Arg Asp Glu Giu Ser Pro Val lie Ala Arg Me Thr Gin Arg Val 
390 395 . 400 



1436 



20 



acc gac atg ace ggg etc age atg ctg cac gcc gag gag ctt eag gtc 
Thr Asp Met Thr Gly Leu Ser Met Leu His Ala Glu Glu Leu Gin Val 
405 410 415 



1484 



25 



gtc aac tac gge ata ggg gga cac tac gaa ccg cac ttc gac ttc get 
Val Asn Tyr Gly lie Gly Gly His Tyr Glu Pro His Phe Asp Phe Ala 
420 425 430 435 



1532 



30 



agg aaa egt gag aat cea ttc acg aaa ttc gge gge aac aga ata gee 
Arg Lys Arg Glu Asn Pro Phe Thr Lys Phe Gly Gly Asn Arg He Ala 

440 445 450 



1580 



35 



acc gtc etc ttc tac atg tct gac gtg geg cag gge gge get aca gtg 
Thr Val Leu: Phe Tyr Met Ser Asp Val Ala Gin Gly Gly Ala Thr Val 

455 460 465 



1628 



40 



ttc acc gaa ctt gga etc agt ttg ttt cea ata aaa ega get geg geg 
Phe Thr Glu Leu Gly Leu. Ser Leu Phe Pro He Lys Arg Ala Ala Ala 
470 475 480 



1676 



45 



ttc tgg ttg aac ctg cac geg teg gge gaa gga gac etc gee ace agg 
Phe Trp Leu Asn Leu. His Ala Ser Gly Glu Gly Asp Leu Ala Thr Arg 
485 490 495 



1724 



so 



cat gcc gee tgc ecc gtg etc agg gga tec aag tgg gtg tea aat aaa 
His Ala Ala Cys Pro Val Leu Arg Gly Ser Lys Trp Val Ser Asn Lys 
500 505 510 515 



1772 



55 



tgg ata eat eaa gge ggg eaa gag ctg ttg agg eee tgc gac ctt gag 
Trp He His Gin Gly Gly Gin Glu Leu Leu Arg Pro Cys Asp Leu Glu 



1820 



16 
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520 525 530 

tac cag gag gag ggc ate ate cgc aag att cct cgt cca gtg ccg aag 1868 
Tyr Gin Glu Glu Gly lie He Arg Lys lie Pro Arg Pro Val Pro Lys 

535 540 545 

aca tec agg tag actcegagca acgtgtgacc gaacggaact gateataaat 1920 
Thr Ser Arg 
550 

tgttcggtat agtgcatttt ttttttaaat aaaatagatt aaatattaca tagtataata 1980 
gaatgttggg tctgtgtagt acacgtattc gacttctgtg tctcgtgcga ttttacgaat 2040 
aaccacaagc ataatcgact agaatgcata atttttaatt agttatgggt caagaaagaa 2100 
gattctcatt atgaaagacg ccaagaaaat tacgccaact aacttttagg agatgtttta 2160 
aataaccaat aatttgagtg caatttatac tatactttta atatatttat taaattgttc 2220 
caatttcttt actttcccct atttagaacg tatagtttaa gttaacgttg atttgtttga 2280 
ttgtaaaata ttaacagtaa actcacacca ggcacacaca aaaaaccatc tattgttgcg 2340 
aacaaattca caaataaaca tatctaataa aaaaaaatgt taaataataa ataagttgca 2400 
atgaaacact aatatatttt. aacatattat atattatagt gatatcttag aataaaccat 2460 
cgtggetcga tggctcgtgg eta 2483 



<210> 2 
<211> 550 
<212> PRT 
<213> Bombyx mori 

<400> 2 

Met Glu Thr Val Arg Ala lie Ala Phe Leu Leu Leu Phe Phe Phe Thr 

1 5 10 15 

Trp Ala Lys Ala Giu Leu Phe Thr Ala He Thr Asp Val Glu Pro Leu 

20 25 30 

Leu Glu Thr His Lys Arg He He Asp Asp Leu Asp Asp Tyr Leu Gin 
35 40 45 



pMOSRA-4B 
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F i g. 2 



Bglll 0.00 

Hindlll 0.26 
EcoRIO.34 
PstI 0.35 

BamHIO.55 



Seridn promoter 



SacM.27 
PstI 1 .33 
Xhol 1 .36 

Pstf 1.651 



^Collagen III 



EcoRi 2.06 

Sacl 2:32 

Kpnl2.32 Bglll 2.82 




Fib-L polyA 



pMOSRA-5 



Bglll 0.00 



PstI 0.65 
Xhol 0.78 



Fib-L promoter 

Fib-L sig. pep. 



Sad 1.46 
PstI 1 .53 

Xhol 1 .55 

Pst1 1 .84 



EC0RI2.25 

Sacl 2.51 

Kpnl2.52 Bgill3.01 




A Collagen III 



Rb'L polyA 



pMOSRA-6 



Xbat0.00 



PstI 0.65 
EcoRI 0.66 
Bglll 0.87 



Seal 1.25 

Xhol 1 .46 



m^mmm^&mi tiiiiiiiHiiiiniHiiiiiiiiiiniiiiiiiiii 



Fib-L promoter Fib-L full length 



Sacl 2.14 
PstI 2.21 
Xhol 2.23 

PstI 2.52 



EC0RI2.93 

Sacl3.19 

<pnl3.20 




Bg1tl3.69 
bal3.70 



A Collagen III 



Fib-L polyA 
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Lys Glu Glu Arg Arg Leu Phe Thr Leu Lys Lys His Leu Asn Leu Tyr 

50 55 60 

Lys Arg Glu His Glu Arg Ala Met Asp Asp lie Pro Asn Tyr Leu Giy 
65 70 75 80 

Asn Pro lie Asn Ala Phe Thr Leu Me Lys Arg Leu Thr Ala Asp Leu 

85 90 95 

Asp Phe He Glu Asp Ser lie Lys Me Gly Thr Glu Tyr Me Lys Asn 

100 105 110 

Val Thr Uet Asn His Val Asp Val Lys Tyr Pro Ser Leu Glu Asp Leu 

115 120 125 

Thr Gly Ala Ala Gin Ala Leu Thr Arg Leu Gin Glu Thr Tyr Tyr Leu 

130 135 140 

Asn Val His Asp Leu. Ala Glu Gly Me Leu Asn Gly Val Ser Tyr Ser 
145 150 155 160 

Thr Pro Met Thr Ala Ser Asp Cys Tyr Glu Leu Gly Arg Thr Leu Tyr 

165 170 175 

Asn Asp Lys Asp Tyr Thr Asn Ala Leu Ala Trp Met Lys Glu Ala Leu 

180 185 190 

Arg Lys Tyr Lys Asp Glu Asn Val Met Tyr Pro Phe Thr Glu Val Asp 

195 200 205 

He Leu Glu Tyr Me Gly Phe Ala Tyr Tyr Leu Asn Gly Asp Val Lys 

210 215 220 

Thr Ala Leu Glu Trp Thr Gin Arg Leu Leu Ser Val Asp Pro Lys His 
225 230 235 240 

Val Arg Ala Arg Gly Asn Me Pro His Tyr Gin Lys Thr Me Ala Glu 

245 250 255 

Gin Glu Ala Glu Leu Lys Lys Gin Gin Arg Gly Glu Thr Ser Asp Glu 

260 265 270 

Pro Glu Glu Glu Asp Gly Gin Asp Tyr Glu Leu. Ser Glu Tyr Ala Lys 

275 280 285 

Glu Arg Lys Val Tyr Glu Ser Leu Cys Arg: Gly Glu Met Glu He Pro 

290 295 300 

His Glu He Thr Lys Arg Leu Lys Cys Trp Tyr Val Thr Asp Thr His 
305 310 315 320 

Pro Phe Leu Lys Leu Ala Pro He Lys Val Glu Gin Met Tyr Val Lys 

325 330 335 

Pro Asp He Phe Met Phe His Glu. Val Met Thr Asp Asp Glu He Glu 

340 345 350 

Phe He Lys Lys Arg Ala Lys Pro Arg Phe Lys Arg Ala Val Val His 

355 360 365 

Asp Pro Lys Thr Gly Glu Leu Thr Pro Ala His Tyr Arg He Ser Lys 

370 375 380 

Ser Ser Trp Leu Arg Asp Glu Glu Ser Pro Val He Ala Arg He Thr 
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385 390 395 400 

Gin Arg Val Thr Asp Met Thr Gly Leu Ser Met Leu His Ala Glu Glu 

405 410 415 

Leu Gin Val Val Asn Tyr Gly lie Gly Giy His Tyr Glu Pro His Phe 

420 425 430 

Asp Phe Ala Arg Lys Arg Glu Asn Pro Phe Thr Lys Phe Gly Gly Asn 

435 440 . 445 

Arg Me Ala Thr Val Leu Phe Tyr Met Ser Asp Vat Ala Gin Gly Gly 

450 455 460 

Ala Thr Val Phe Thr Glu Leu Giy Leu Ser Leu Phe Pro Me Lys Arg 
465 470 475 480 

Ala Ala Ala Phe Trp Leu Asn Leu. His Ala Ser Gly Glu Gly Asp Leu 

485 490 495 

Ala Thr Arg H\s Ala Ala Cys Pro Val Leu Arg Gly Ser Lys Trp Val 

500 505 510 

Ser Asn Lys Trp Me His Gin Gly Gly Gin Glu Leu Leu Arg Pro Cys 

515 520 525 

Asp Leu Glu Tyr Gin Glu Glu Gly lie lie Arg Lys Me Pro Arg Pro 

530 535 540 

Val Pro Lys Thr Ser Arg 
545 550 



<210> 3 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 3 

tcgtaactgc ctacacgttt gc 22 



<210> 4 
<211> 20 
<212> DMA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
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<400> 4 

agacgtgaac ctggctggct 20 



<210> 5 
<211> 15 
<212> ONA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 5 

tttagactcg agcct 15 



<210> 6 
<211> 25 
<212> ONA 

<213> Artificial sequence 
<220> 

<2]3> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 6 

tacagttctt atctagacgt gaacc 25 



<210> 7 
<21t> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 7 

gacataatat gtgacgctcg agaattagac tgc 33 



<210> 8 
<2n> 22 
<212> DNA 
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<213> Artificial sequence 



<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 8 

ggatccagac atgataagat ac 22 



<210> 9 
<211> 21 
<212> ONA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 9 

gatcataatc agccatacca c 21 



<210> 10 
<2n> 22 
<212> ONA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 10 

gatcccccta. gaatcccaaa ac 22 

<210> 11 
<2.11> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 11 

cttagcgacg tgttcacttt gc 22 
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<210> 12 

<2n> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 

<400> 12 

agatctagtc gaatttcgac tctgcg 27 



<210> 13 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 13 

cccgatgata agacgactat g 21 



<210> 14 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 14 

ccaggttcac gtctaaataa. g 21 



<210> 15 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 
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<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 15 

agatctcatg acaacagtac cgaaatc 27 



<210> 16 
<211> 21 
<212> DNA 

<213> Artificial seouence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 16 

taacagacca ctaaaatgaa g 21 



<210> 17 
<211> 27 
<212> ONA 

<213> Artificial sequence 
<220> 

<2.13> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 17 

ctcgagtagc ttttccatca tcaatgt 27 



<210> 18 
<211> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 18 

tctagaagat ctggtacggt tcgtaaagtt cac 33 



<210> 19 
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<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 19 

gtctgttatg tgaccaatcs g 21 



<210> 20 
<211> 21 
<212> ONA 

<213> Artificial seauence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 20 

acagaccact aaaatgaagc c Zl 



<210> 21 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Siequence: Synthesized oligonucleotide 
<400> 21 

ctcgagcctg gctggctgct tgtgcaa 27 



<210> 22 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 
<400> 22 
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tgttccacgg aaacactggt g 



21 



<210> 23 
<211> 21 
<212> ONA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 



<210> 24 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Oescription of Artificial Sequence: Synthesized oligonucleotide 



<210> 25 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial Sequence: Synthesized oligonucleotide 



<400> 23 

catcctccag aactgtgtag g 



21 



<400> 24 

cacaacgtct atatcatggc eg 



22 



<400> 25 

atgttgtggc ggatcttgaa g 



21 



<210> 26 
<211> 19 
<212> DKA 



<213> Artificial sequence 
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<220> 

<213> Description of Artificial 
<400> 26 

tncargtngc naaytaygg 



<210> 27 
<2t1> 19 
<212> ONA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial 
<400> 27 

cnccnccngc nswnacrtc 



<210> 28 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial 
<400> 28 

gaggacggtg gctattctgt tgccg 



<210> 29 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<213> Description of Artificial 
<400> 29 

ggcatagggg gacactacga accgc 



Sequence: Synthesized oligonucleotide 

19 

Sequence: Synthesized oligonucleotide 

19 

Sequence: Synthesized oligonucleotide 

25 

Sequence: Synthesized oligonucleotide 

25 



26 
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Claims 

1 . A transformed silkwomi, which comprises a polynucleotide encoding human collagen within the genomic DNA and 
produces recombinant human collagen as a part of proteins in the cocoon or the silk gland. 

5 

2. The transformed silkworm of claim 1 further comprising a polynucleotide encoding at least one of the a subunit 
and the p subunit of prolyl hydroxylase within the genomic DNA. 

3. The transfomned silkworm of claim 2, wherein the polynucleotide encoding the prolyl hydroxylase a subunit is at 
10 least a coding region of a DNA fragment having a base sequence of SEQ ID NO: 1 . 

4. A transformed silkwomi, which comprises a polynucleotide encoding a fusion protein of human collagen with a 
silkworm silk protein within the genomic DNA and produces said fusion protein as a part of proteins in the cocoon 
or the silk gland. 

15 

5. The transfonned silkworm of claim 4 further comprising a polynucleotide encoding at least one of the a subunit 
and the p subunit of prolyl hydroxylase within the genomic DNA. 

6. The transfomned silkworm of claim 5, wherein the polynucleotide encoding the prolyl hydroxylase a subunit is at 
20 least a coding region of a DNA fragment having a base sequence of SEQ ID NO: 1 . 

7. A process for producing recombinant human collagen, which comprises isolating and purifying recombinant human 
collagen from the cocoon or the silk gland of the transformed slikwomn of claim 1 , 2 or 3. 

25 8. A process for producing recombinant human collagen, which comprises isolating a fusion protein of recombinant 
human collagen with a silkwomn silk protein from the cocoon or the silk gland of the transfonmed silkworm of claim 
4, 5 or 6, and isolating and purifying recombinant human collagen from the fusion protein. 

9. A fusion protein of recombinant human collagen with a silkworm silk protein produced by the transfonmed silkworm 
30 of claim 4, 5 or 6. 

10. A targeting vector derived from Autographs californica nuclear polyhedrosis virus, whfch comprises a polynucle- 
otide encoding human collagen . 

35 11. The targeting vector of claim 1 0, wherein the polynucleotide is a human collagen expression cassette. 

12. The targeting vector of claim 11, wherein the human collagen expression cassette is a fusion polynucleotide In- 
cluding a polynucleotide encoding human collagen ligated. under the control of an expression regulatory sequence 
of a gene for silkworm silk protein. 

40 

13. The targeting vector of claim 11, wherein the human collagen expression cassette is a fusion polynucleotide In- 
eluding a polynucleotide encoding a silkworm silk protein and a polynucleotide encoding human collagen ligated 
under the control of an expression regulatory sequence of a gene for silkworm silk protein. 

45 14. A targeting vector derived from Autograft californica nuclear polyhedrosis virus, which comprises a polynucle- 
otide encoding at least one of the a subunit and the p subunit of prolyl hydroxylase. 

15. The targeting vector of claim 14. wherein the polynucleotide encoding the prolyl hydroxylase a subunit is at least 
a coding region of a DNA fragment having a base sequence of SEQ ID NO: 1 . 

50 

16. A recombinant plasmid vector having a human collagen expression cassette in a region sandwiched between a 
pair of inverted temninal repeats of DNA transposon derived from an insect. 

17: The recombinant plasmid vector of claim 16, wherein the human collagen expression cassette is a fusion potynu- 
55 cleotide including a polynucleotide encoding human collagen ligated under the control of an expression regulatory 

sequence of a gene for silkworm silk protein. 

18. The recombinant plasmid vector of claim 16, wherein the human collagen expression cassette Is a fusion polynu- 
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cleotide including a polynucleotide encoding a silkworm silk protein and a polynucleotide encoding human collagen 
ligated under the control of an expression regulatory sequence of a gene for silkworm silk protein. 

19. A recombinant plasmid vector having a polynucleotide encoding at least one of the a subunit and the p subunit of 
prolyl hydroxylase In a region sandwiched between a pair of inverted terminal repeats of DNA transpose n derived 
from an Insect. 

20. The recombinant plasmid vector of claim 1 9. wherein the polynucleotide encoding the prolyl hydroxylase a subunit 
is at least a coding region of a DNA fragment having a base sequence of SEQ ID NO: 1 . 

21 . A set of vectors, which comprises a recombinant plasmid vector of claim 16, 1 7 or 18, and a recombinant plasmid 
vector having a polynucleotide encoding transposase of the transposon. 

22. A set pf vectors, which comprises a recombinant plasmid vector of claim 19 or 20, and a recombinant plasmid 
IS vector having a polynucleotide encoding transposase of the transposon. 

23. A polynucleotide encoding the o subunit of silkworm prolyl hydroxylase, which has the amino acid sequence of 
SEQ ID NO: 2. 

20 24. The polynucleotide of claim 23, which is a DNA fragment having a base sequence of SEQ ID NO: 1 . 



10 



25 



30 



35 



40 



45 



50 



55 
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Fig. 1 




13; 

Kpnl )3 



Sad 11.7 



Xtel 11^ 



•^^^''^Sma* 10»4/ I Sad 9.7 
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F i g. 3 




mini collagen EGFP 
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Fig. 4 



M 1 2 C 




mini collagen fragment 
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